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Abstract^ 

It  is  shown  here  that  the  equivalence  class  of  an  expression  under  the  congruence  closure  of  any 
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There  arc  several  areas  in  which  it  is  important  to  be  able  to  "solve"  for  certain  expressions.  Hie 
most  familiar  example  is  in  applied  mathematics  where  one  is  interested  in  "solving"  a  system  of  simultaneous 
equations.  A  related  problem  is  that  of  answering  data  base  queries  of  the  form  "what  is  x?”.  In  general  one 
is  presented  with  a  collection  of  terms,  some  methods  for  showing  equivalences  between  terms,  and  the 
problem  of  "solving  for”  or  "simplifying"  some  particular  term. 

One  motivation  for  algorithms  which  solve  for  expressions  arises  from  procedural  attachment 
Consider  a  data  base  consisting  of  a  collection  of  first  order  sentences  and  suppose  that  the  binary  function 
symbol  +  is  intended  to  denote  ordinary  addition  over  the  integers.  Clearly  it  is  possible  to  directly  compute 
the  sum  of  two  numerals  and  thus  one  can  benefit  from  associating  the  function  symbol  +  with  such  an 
addition  procedure  (such  procedural  attachment  in  FOL  is  described  by  Wcyhrauch  [1]).  However  to  apply 
the  addition  procedure  to  a  term  such  as  i|y]|  one  must  solve  for  its  subterms,  in  this  case  flx|  and  l|y], 
in  terms  of  numeric  constants.  Thus  the  usefulness  of  procedures  attached  to  function  or  predicate  symbols 
can  depend  upon  the  ability  to  solve  for  expressions. 

Another  motivation  for  algorithms  that  solve  for  expressions  arises  from  Moore’s  notion  of  a  rigid 
designator  |2].  Moore  addresses  the  philosophical  question  of  when  an  individual  (or  a  data  base)  "knows" 
the  meaning  of  some  term.  For  example  when  docs  an  individual  "know"  John’s  phone  number.  Moore 
addresses  this  question  by  first  defining  a  modal  logic  with  a  Kripke  style  semantics  and  then  defining  the 
notion  of  a  "rigid  designator”  as  a  term,  such  as  the  numeral  2,  which  has  the  same  meaning  in  all  possible 
worlds.  Moore  then  defines  "knowing  the  meaning  of  a  term"  such  that  a  person  knows  the  meaning  of  the 
term  phone# [John]  just  in  case  there  is  a  rigid  designator  d  such  that  the  person  can  deduce  that 
phonc#[John]  equals  d.  The  details  of  Moore's  constructions  are  not  important  here  other  than  to  note  that 
he  makes  a  distinction  between  terms  which  may  denote  different  things  in  different  worlds  and  rigid 
designators  which  can  not.  Now  consider  a  data  base  consisting  of  sentences  in  Moore's  logic  and  a  question 
of  the  form  "what  is  phone #(John]?".  Intuitively  one  should  expect  the  system  to  cither  answer  "I  don’t 
know",  or  to  return  a  rigid  designator  such  as  the  numeral  2537884.  Thus  the  problem  of  answering  a  query 
of  the  form  "what  is  x?”  can  be  reduced  to  the  problem  of  "solving”  for  x  in  terms  of  a  rigid  designator. 

In  general  suppose  that  some  set  of  symbols  has  been  identified  as  "independent"  (or  rigid  in 
Moore’s  sense)  and  all  other  symbols  are  considered  "dependent".  The  problem  addressed  here  is  that  of 
taking  a  finite  set  of  equalities  2  between  ground  terms  and  a  particular  term  u  and  "solving"  for  u  in  terms  of 
the  independent  symbols  by  performing  substitutions  of  equals  for  equals.  As  an  example  consider  the  three 
equations  a  =  mm  b  =  Ha]  and  c  =  fib]  and  suppose  that  the  symbols  f  and  c  arc  taken  to  be 
independent.  It  is  possible  to  solve  for  b  in  terms  of  f  and  c,  specifically  b  =  flflcl|.  As  another  example 
consider  the  equations  a  =  Ijb  c|,  b  =  gja  c),  and  c  =  f]g|si  c]  c|  where  g  and  c  arc  taken  to  be  independent. 
It  follows  from  these  equalities  that  b  =  g]c  cj. 

Ihc  only  technique  for  showing  equivalences  used  by  the  procedures  described  here  is  the 
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substitution  of  a  ground  expression  (or  an  equivalent  ground  expression.  I  he  problem  of  solving  lor  an 
expression  using  substitution  as  the  onh  means  of  showing  equivalences  may  seem  like  a  very  special  case  of 
the  problem  of  solving  for  an  expression  using  arbitrary  techniques  lor  show  ing  equivalences.  However  the 
substitution  of  equals  for  equals  can  play  an  important  role  in  cases  where  other  techniques  for  showing 
equivalences  arc  also  sound.  For  example  consider  a  set  of  simultaneous  linear  equations  which  can  be 
’'solved"  using  some  standard  matrix  inversion  procedure.  Steele  and  Sussman  [3]  propose  an  alternative  to 
the  standard  matrix  inversion  techniques  which  they  call  "constraint  propagation".  Constraint  propagation 
can  be  more  efficient  than  matrix  inversion  when  the  matrix  of  coefficients  is  sparse.  Constraint  propagation 
also  seems  to  model  the  way  people  often  solve  sets  of  equations.  In  constraint  propagation  a  set  of  equations 
is  expanded  to  a  larger  set  by  solving  each  equation  locally  for  each  variable  appearing  in  that  equation. 
There  is  then  a  "propagation"  phase  based  purely  on  the  substitution  of  equals  for  equals.  This 
"propagation"  phase  of  the  process  could  be  handled  by  the  algorithms  presented  here.  A  more  complete 
discussion  of  the  relationship  between  constraint  propagation  and  substitution  of  equals  for  equals  is  given  in 

14J. 

The  problem  of  solving  for  an  expression  given  a  set  of  equalities  between  ground  terms  can  be 
usefully  generalized  in  two  ways.  First  instead  of  considering  a  set  of  equations  one  can  consider  an  arbitrary 
Boolean  formula  built  up  from  equalities  between  ground  terms.  For  example  the  following  Boolean  formula 
implies  the  equality  f(x)=l. 


(x=a  V  x=b)  A  f|a|  =  1  A  flb)  =  l 

A  second  way  the  problem  can  be  generalized  is  to  consider  simplification  under  some  simplicity 
order  on  terms.  Solving  for  an  expression  is  just  a  special  case  of  simplification.  To  see  this  consider  the 
partial  order  on  terms  defined  by  making  a  term  u  "simpler"  than  a  term  v  just  in  case  v  contains  dependent 
symbols  while  u  docs  not.  Under  such  a  simplicity  order  solving  for  a  given  term  is  equivalent  to 
"simplifying"  that  term  to  an  expression  containing  only  independent  symbols. 

The  importance  of  the  more  general  problem  of  simplifying  terms  under  a  predefined  simplicity 
order  can  be  seen  by  considering  a  set  of  simultaneous  equations  in  applied  mathematics.  Numerals  and 
terms  constructed  purely  from  numerals  and  "known"  function  symbols  such  as  +  are  in  some  sense  simpler 
than  other  terms.  Often  symbolic  "constants"  are  used  in  applied  mathematics  and  terms  constructed  purely 
from  numeric  and  symbolic  constants  arc  in  some  sense  simpler  than  terms  which  contain  "variables". 
Variables  arc  sometimes  divided  into  dependent  and  independent  variables  and  terms  which  contain  no 
dependent  variables  are  in  some  sense  simpler  than  terms  which  do  not.  Also  the  size  (or  some  other  measure 
of  complexity)  can  be  an  important  factor  in  determining  a  terms  simplicity. 

(he  above  problems  arc  approached  here  by  establishing  a  relationship  between  equivalence  classes 
and  context  free  expression  grammars.  A  context  free  expression  grammar  is  just  like  a  context  free  string 
grammar  except  that  it  describes  a  set  of  expressions  (terms)  rather  than  a  set  of  strings  (it  is  important  to  note 
that  an  cxpicssion  is  a  tree  and  therefore  has  an  inherent  phrase  structure  while  a  siring  has  no  such 
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structure).  A  finite  set  2  of  equalities  between  ground  expressions  induces  a  umgruciuc  relation  on 
expressions  where  two  expressions  arc  congruent  just  in  case  they  can  be  proven  equal  from  the  equalities  in 
2.  Such  a  congruence  relation  will  be  called  a  finitely  equational  congruence  relation.  A  finitely  cquational 
congruence  relation  can  be  represented  by  a  context  free  expression  grammar.  Given  a  finite  set  2  of 
equalities  between  expressions  the  congruence  closure  algorithm  of  Downey,  Sethi,  and  l  arjan  |5|  can  be  used 
to  construct  a  context  free  expression  grammar  G  such  that  for  any  expression  u  the  equivalence  class  of  u  is 
the  language  generated  by  an  expression  form  F(u)  under  G.  Furthermore  G  can  be  constructed  from  2  in 
order  /ilog(/i)  average  time  where  n  is  the  size  of  2. 

Let  u  be  an  arbitrary  expression  and  let  Q  be  an  arbitrary  consistent  Boolean  expression  built  up 
from  equalities  and  the  standard  Boolean  connectives  A,  and  V.  Using  the  fact  that  context  free 
expression  languages  are  closed  under  intersection  it  is  possible  to  construct  a  context  free  expression 
grammar  G  which  generates  the  set  of  expressions  which  can  proven  equivalent  to  u  given  the  formula  Q. 
Ill  is  procedure  can  be  used  to  compute  a  grammar  for  the  class  of  simple  expressions  which  arc  equivalent  to 
a  conditional  expression  of  the  form  if  Q  then  u  else  v. 

Finally  a  procedure  is  developed  for  simplifying  expressions  under  an  arbitrary  well  behaved 
simplicity  order.  Let  G  be  a  grammar  such  that  the  equivalence  class  of  an  expression  u  is  the  language 
generated  by  an  expression  form  T(u)  under  G.  For  any  well  behaved  simplicity  order  there  is  a  subset  G’  of 
the  productions  of  G  such  that  the  language  generated  by  T(u)  under  G’  is  precisely  the  set  of  simplifications 
of  u.  The  subset  G’  can  be  computed  from  G  in  order  nlogfn)  time  plus  the  time  required  to  do  order  nlogfn) 
comparisons  between  expressions  where  n  is  the  size  of  G. 

It  is  hoped  that  the  ability  to  represent  finitely  equational  congruence  relations  with  context  free 
expressions  grammars  will  provide  both  a  deeper  understanding  of  such  congruence  relations  and  a  more 
flexible  computational  framework  in  which  to  perform  deduction. 
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2.  EXPRESSIONS,  EQUATIONS,  AM)  GRAMMARS 


The  algorithms  described  here  work  just  as  well  on  second  order  terms  as  on  first  order  terms.  Thus 
expressions  which  arc  intended  to  denote  functions  or  predicates  arc  not  handled  any  differently  from 
expressions  denoting  domain  elements.  In  fact  the  algorithms  described  here  do  not  depend  on  any  "typing" 
at  all.  Ihis  observation  motivates  the  following  definition  of  the  set  of  all  expressions  over  an  alphabet  A. 

Definition:  An  expression  over  an  alphabet  A  is  either  a  symbol  in  A  or  an  n-tuplc 

<u  |  uj ...  u„>  of  expressions  over  A. 

This  definition  of  an  expression  is  similar  to  the  definition  of  an  s-expression  in  LISP.  While  all  of 
the  algorithms  described  here  operate  on  expressions  they  work  just  as  well  when  restricted  to  typed 
expressions  or  first  order  terms. 

Consider  a  set  of  equalities  2  between  expressions.  If  2  was  a  set  of  equalities  between  first  order 
terms  then  it  would  be  clear  what  equalities  follow  from  2.  However  since  expressions  are  in  some  sense 
more  general  than  first  order  terms  an  explicit  definition  of  the  set  of  equalities  which  arc  dcducible  from  2  is 
given  below  (it  is  based  on  the  standard  deductive  properties  of  equality).  There  is  a  simple  semantics  (which 
will  not  be  presented  here)  for  expressions  under  which  the  following  notion  of  "dcducible"  is  both  sound 
and  complete. 

Definition:  The  set  of  equalities  deducible  from  a  set  of  equalities  2,  is  the  smallest  set  which 

contains  2  and  which  satisfies  is  the  following  deductive  principles: 

Reflex ivity :  For  any  expression  u,  u  =  u  is  dcducible. 

Symmetry:  If  u  =  v  is  dcducible  from  2  then  v  =  u  is  dcducible  from  2. 

Transitivity:  If  both  v  =  w  and  u  =  w  arc  dcducible  from  2  then  u  =  v  is 
dcducible  from  2. 

Subsiitutivity:  If  the  equalities  Uj  =  vi>  u2  =  Vj, ...  un  =  vn  arc  all  dcducible 
from  2  then  the  equality  <Uj  U2 ...  un>=<vj  v2  •••  vn* is  Reducible  from  2. 

The  following  definitions  and  lemma  2.1  provide  some  basic  concepts  relating  sets  of  equalities  to 
equivalence  relations  on  expressions.  While  these  notions  may  seem  obvious  and  redundant  they  facilitate 
precision  in  later  sections. 

Definitions:  Let  ~  be  an  equivalence  relation  on  expressions,  lhc  relation  ~  will  be  said 

to  subsume  an  equality  u  =  v  just  in  ease  u  is  equivalent  to(v  under  j.c.  just  in  ease  u~v. 

The  relation  25  will  be  said  to  subsume  a  set  of  equalities  2  just  in  ease  it  subsumes  every 
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equality  in  2.  The  relation  ~  is  called  a  connivence  relation  just  in  case  it  is  substitute  c.  i.e. 
whenever  the  equivalences  U|~vj,  iij^v-,.  ...  un~\n  hold,  the  equivalence 
<uj  u-, ...  un>SS<vj  V2  ...  vfi>  also  holds. 

lemma  2.1 :  Congruence  relations  arc  deductively  closed,  i.e.  if  ~  is  a  congruence  relation 
and  2  is  a  set  of  equalities  subsumed  by  5;  then  any  equality  deduciblc  from  2  is  also 
subsumed  by  ss. 

The  following  definition  provides  another  characterization  of  the  set  of  equalities  deduciblc  from  a  given  set 

2. 

Definition:  The  congruence  closure  of  a  set  of  equalities  2  is  the  equivalence  relation 
on  expressions  which  subsumes  exactly  those  equalities  which  are  deduciblc  from  2,  i.e. 
u~£v  just  in  case  the  equality  u  =  v  is  deduciblc  from  2.  The  equivalence  class  of  an 
expression  u  under  the  congruence  closure  of  2  is  denoted  |u|j>.  The  congruence  closure  of 
a  finite  set  2  will  be  called  a  finitely  equalional  congruence  relation. 

It  is  interesting  to  note  that  |u|j  can  be  an  infinite  set  even  when  2  is  finite.  For  example  if  2  is 
{<f  a>  =  a}  then  |a|^  includes  all  expressions  of  the  form  <f  <f  ...<f  a>...».  The  main  result  of  this  section 
will  be  that  |u|£  can  be  described  by  a  context  free  expression  grammar  as  defined  below. 

Definition:  Let  A  be  an  alphabet  of  terminal  symbols  and  let  A'  be  a  collection  of 
non-terminal  symbols  such  that  N  is  disjoint  from  A.  A  context  free  expression  grammar 
over  A  and  A  is  a  set  of  productions  of  the  form  X=>a  where  X  is  a  non-terminal  symbol 
and  a  is  an  expression  over  A  union  N. 

Expression  grammars  are  very  much  like  string  grammars.  Each  non-terminal  symbol  of  an 
expression  grammar  generates  a  set  of  expressions  over  the  terminal  alphabet  in  the  same  way  that  each 
non-terminal  of  a  context  free  string  grammar  generates  a  set  of  strings.  For  example  the  grammar  consisting 
of  the  productions  A=»a  and  A=*<f  A>  describes  the  set  of  expressions  of  the  form  <f  <f...<f  a>...». 

Let  A  be  a  set  of  terminal  symbols  and  A  be  a  set  of  non-terminal  symbols.  Expressions  over  A 
(expressions  containing  only  terminal  symbols)  will  be  called  terminal  expressions  (or  simply  expressions)  and 
will  be  denoted  by  the  letters  u,  v,  and  w.  Expressions  over  A  union  N  will  be  called  expression  forms  and  will 
be  denoted  by  the  greek  letters  a,  ft.  and  y.  For  a  given  grammar  G  the  relation  =>*  is  defined  on  expression 
forms  as  the  smallest  relation  satisfying  die  following  three  properties: 
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1)  For  any  expression  form  a,  «=>*<«. 

2)  If  X=>«  is  a  production  ofG  then  X=>*a. 

3)  If  then  for  any  expression  form  <yj  ...  a  ...yn>  containing  a, 

<Y1...a...Yn>=>*<y1...^...Yn> 

Intuitively  a=>*/3  just  in  ease  /?  can  be  derived  from  a  by  replacing  some  number  of  non-terminal 
symbols  in  a  by  expressions  which  they  generate.  The  language  generated  by  an  expression  form  a  is  defined 
to  be  the  set  of  terminal  expressions  u  such  that  a=**u. 

Ibc  investigation  of  context  free  expression  grammars  is  motivated  by  the  existence  of  a  relationship 
between  a  certain  class  of  such  grammars  and  finitely  equational  congruence  relations  on  expressions.  ITte 
following  definition  identifies  the  relevant  class  of  grammars. 

Definition:  A  context  free  expression  grammar  will  be  called  normal  if  the  following  two 
conditions  hold.  First  all  productions  are  either  of  the  form  X=>a  where  a  is  a  terminal 
symbol,  or  of  the  form  X=><Y  j  Y2 ...  Yn>  where  each  Yj  is  a  non-terminal  symbol.  Second 
there  are  no  two  productions  X=>a  and  such  that  X  and  Y  are  distinct  non-terminals 
but  such  that  a  and  fi  are  the  same  expression  form. 

It  turns  out  that  any  normal  grammar  G  represents  a  congruence  relation  on  expressions.  In  general 
all  expressions  generated  by  a  given  expression  form  a  will  be  equivalent  under  this  relation.  However  not  all 
expression  forms  generate  entire  equivalence  classes.  In  particular  if  a=>*fi  then  the  language  generated  b; 
ft  may  be  only  a  proper  subset  of  the  language  generated  by  a.  It  turns  out  that  die  expression  forms  which 
generate  entire  equivalence  classes  are  precisely  the  maximal  expression  forms  where  maximal  is  defined  as 
below: 

Definition:  An  expression  form  a  will  be  called  maximal  under  an  expression  grammar  G 
just  in  ease  there  is  no  expression  form  fi  other  than  a  such  that  /?=>  *a  under  G. 

Consider  the  cquivalcncc.class  of  the  expression  <g  a>  under  the  congruence  closure  of  the  equation 
<fa>=a.  T  he  following  normal  grammar  "represents"  the  congruence  relation  on  expressions  imposed  by 
this  equality  (the  first  "production"  below  represents  in  the  standard  way  the  pair  of  productions  A=>a  and 
A=»<F  A». 

A  =>  a  |  <F  A> 

F  =>  f 

The  equivalence  class  of  <g  u>  under  the  equality  <f  a>  =a  is  exactly  the  language  generated  by  the 
expression  form  <g  A>  under  the  above  grammar.  Note  that  the  expression  forms  A  and  <g  A>  arc  maximal 
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under  thisgramniai  while  the  expicvsion  form  <f  A>  is  1101. 

For  any  function  T  on  expressions  let  denote  the  equivalence  relation  such  that  u~  |  v  just  in 
case  Hu)  equals  Rv).  l-urtheiinore  for  any  equivalence  relation  ~  on  expressions  and  any  expression  \  let 
Mj-j  denote  the  equivalence  class  of  v  under  ~.  Ihe  following  theorem  establishes  the  (list  half  of  the 
relationship  between  normal  expression  grammars  and  finitely  cquational  congruence  relations. 

I.enuna  2.2:  I.ct  G  be  any  normal  context  free  expression  grammar.  For  each  expression  u 
there  exists  a  unique  expression  form  Ru)  which  is  maximal  under  G  and  which  generates  u. 

Thus  there  exists  a  unique  function  R  called  the  maximal  generator  function  of  G.  which 
maps  each  expression  u  to  the  maximal  expression  T(u)  which  generates  u.  Furthermore 
is  a  finitely  cquational  congruence  relation  and  for  any  expression  u  the  language 
generated  by  Ru)  is  precisely  (ul^  . 

proof:  The  function  T  is  defined  recursively  via  the  following  conditions: 

1)  For  any  terminal  symbol  a  if  there  is  a  production  of  the  form  X=^a  then  Ra)  is  X  otherwise 
f(a)  is  a. 


2)  If  u  is  of  the  form  <Uj  u2  ...  uR>  then  if  there  is  a  production  of  the  form 
X=><f(u|)  f(u2) ...  f(un)>  then  f(u)  equals  X  otherwise  Ru)  equals  <Ru^)  Ru2) ...  f(un)>. 


Since  the  normality  of  a  grammar  ensures  that  no  two  distinct  productions  have  the  same  right 
hand  side,  the  above  definition  is  well  formed.  Note  that  Ru)  can  be  computed  in  linear  lime  in 
the  size  of  u  (in  general  membership  in  a  context  free  expression  language  can  be  determined  in 
linear  time  although  this  will  not  be  proven  here). 

A  simple  induction  on  expressions  can  be  used  to  show  that  for  any  expression  u,  Ru)=>*u 
(the  details  arc  left  to  the  reader).  It  will  now  be  shown  by  induction  that  for  any  expression  u.  Ru) 
is  maximal.  To  sec  this  first  note  that  in  a  normal  grammar  each  non-terminal  symbol  is  a  maximal 
expression  form  (the  only  way  a  non-terminal  symbol  X  could  fail  to  be  maximal  is  if  there  was 
some  other  non-terminal  symbol  Y  such  that  Y=>X  was  a  production  of  G.  but  this  is  not  allowed 
in  normal  grammars).  Now  if  a  is  a  terminal  symbol  then  either  Ra)  is  a  non-terminal  symbol 
(which  is  a  maximal  expression  form)  or  Ra)  is  a.  But  if  T(a)  is  a  then  by  the  definition  of  T  there 
can  be  no  production  of  G  whose  right  hand  side  is  a  in  which  case  a  is  a  maximal  expression  form. 
Now  let  u  be  of  the  form  <uj  u2  ...  un>  and  assume  that  for  each  Uj,  Ruj)  is  maximal.  If  Ru)  is  a 
non  terminal  symbol  then  it  is  maximal,  on  the  other  hand  if  Ru)  is  not  a  non-terminal  symbol 
then  it  is  an  n-tuplc  of  maximal  expression  forms  which  is  not  the  right  hand  side  of  any  production 
and  is  therefore  maximal. 

The  proof  that  for  any  expression  v  there  is  at  most  one  maximal  expression  form  which 
generates  v  is  done  by  induction  on  v.  For  a  terminal  symbol  a  if  there  is  no  production  of  the  form 
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\=>a  in  the  grammar  then  a  is  die  unique  maximal  expression  form  vs  Inch  gene  rules  a.  On  the 
other  hand  if  there  is  a  production  of  the  form  X=^a  then  since  the  grammar  is  normal  there  is  only 
one  such  production  and  the  non-terminal  symbol  X  is  the  unique  maximal  expression  form  which 
generates  a.  Now  let  v  be  an  expression  of  the  form  <v  j  V2  ...  vn>  such  that  for  each  \  j  there  is  at 
most  one  maximal  expression  form  which  generates  Vj.  Since  T( s- )  is  maximal  and  generates  v=  it 
the  unique  such  maximal  expression  form.  Now  there  arc  two  cases.  The  first  case  occurs  when 
there  is  a  non-terminal  symbol  X  such  that  X=>*v.  Since  the  grammar  is  normal  it  must  contain  a 
production  of  the  form  X=»<Xj  X2 ...  Xn>  where  each  Xj  is  a  non-terminal  symbol  such  that 
Xj=>*vj.  However  since  each  non-terminal  symbol  is  maximal,  Xj  must  equal  Rvj)  and  therefore 
the  production  X=><r(Vj)  T^)...  T(vn)>  is  in  the  grammar.  This  must  hold  for  every 
non-terminal  symbol  X  which  generates  v,  i.e.  for  each  X  which  generates  v  the  above  production 
must  be  in  the  grammar.  However  since  there  can  be  at  most  one  production  whose  right  hand  side 
is  <r(Vj)  r(v2) ...  r(vn)>  there  can  be  at  most  one  non-terminal  symbol  which  generates  v.  Note 
that  in  this  case  the  expression  form  <T(Vj)  T^) ...  T(vn)>  is  not  maximal.  The  second  case  occurs 
when  there  exists  a  maximal  expression  form  a  which  generates  v  but  which  is  not  a  non-terminal 
symbol,  i.e.  there  exists  an  expression  form  a  which  generates  v  and  is  of  the  form  <oj  aj ...  an>. 
Since  a  is  maximal  each  a-x  must  also  be  maximal  and  since  a  generates  v,  « j  must  generate  vj  for 
each  i.  However  since  there  is  at  most  one  maximal  expression  generating  Vj,  aj  must  be  T(v-)  so  a 
must  be  <r(vj)  T^)...  r(vn)>.  Note  that  in  this  case  <T(Vj)  r(v2) ...  T(vn)>  is  maximal  and 
therefore  the  first  and  second  case  can  never  occur  at  the  same  time.  So  cither  there  is  a  unique 
non-terminal  symbol  which  generates  v  or  there  is  no  such  non  terminal  and  <F(V|)  r(v2) ...  T(vn)> 
is  the  unique  maximal  expression  which  generates  v. 

The  relation  is  substitutive,  i.e.  is  a  congruence  relation.  To  sec  this  let  u  be  an  expression 
of  the  form  <Uj  U2  ...  un>  and  let  v  be  any  expression  of  the  form  <vj  V2  ...  vn>  .  It  suffices  to  show 
that  if  T(Uj)  equals  T(Vj)  for  each  i,  then  T(<Uj  U2  ...  un>)  equals  T(<vj  ...  vn>).  However  this 
follows  directly  from  the  definition  off. 


To  establish  that  the  language  generated  by  F(u)  is  precisely 


first  note  that  if  v  is  in 


|u|~p-  <-c.  if  T(v)  equals  f(u),  then  since  f(v)= 


►  *v  it  must  be  the  case  that  T(u)= 
other  hand  if  r(u)=»*v  then  since  there  is  at  most  one  maximal  expression  form  which  generates  v. 


T(u)  equals  T(v)  and  therefore  v 


is  in  lul. —  . 
1 


Finally  it  must  be  shown  that  the  congruence  relation  defined  by  T  is  finitely  cquational,  i.e. 
that  there  is  a  finite  set  of  equalities  2  such  that  ~p  is  the  same  as  A  non-terminal  symbol  X 
will  be  called  coherent  if  the  language' generated  by  X  is  not  empty.  A  production  or  expression 
form  will  be  called  coherent  if  it  contains  only  coherent  non-terminal  symbols.  Fach  coherent 
non-terminal  symbol  X  can  be  associated  with  some  expression  u(X)  such  that  X=>*u(X).  Now  for 
each  coherent  expression  form  a,  u(a)  is  defined  as  the  result  of  replacing  each  non-terminal 
symbol  X  by  u(X);  clearly  a^’ufa).  Now  let  2  be  the  set  of  equalities  of  the  form  u(X)=  11(a) 
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where  \=>a  is  a  coherent  production  of  G.  Now  if  u(X)-u(o)  is  an  equality  in  X  then  \ 
generates  both  u(X)and  11(0)  tinderG.  Thus  if  u(X)  =  11(a)  is  an  equality  in  X  then  l'(u(X))  equals 
r<u(«»  and  therefore  u(X)~|.u(a).  Thus  every  equality  in  X  is  subsumed  by  and  since  ~p 
is  a  congruence  relation  every  equality  dcducible  from  X  is  therefore  also  subsumed  by  It 
remains  only  to  show  that  every  equality  subsumed  by  ~p  is  dcducible  from  X.  This  is  done  by 
showing  via  a  standard  induction  on  expressions  that  for  any  expression  v  the  equality  v  =  u(T(v))  is 
dcducible  from  2  (the  details  arc  left  to  the  reader).  Thus  if  the  equality  w  =  v  is  subsumed  by  ~  p 
then  since  T(w)  equals  T(v)  and  the  equalities  v  =  u(r(v)>  and  w  =  u(T(w))  arc  both  dcducible  from 
2,  the  equality  w  =  v  is  dcducible  from  2. 

The  following  lemma  completes  the  relationship  between  normal  grammars  and  finitely  cquational 
congruence  relations  by  showing  that  every  finitely  equational  congruence  relation  can  be  represented  by  a 
normal  grammar. 

Lemma  2.3:  For  any  finite  set  of  equalities  2  there  is  a  normal  context  free  expression 
grammar  G  with  i  iximal  generator  function  T  such  that  ~p  is  the  same  as 
Furthermore  G  can  be  computed  from  2  in  order  nlog(n)  average  time  where  n  is  the  total 
size  of  2.  , 

proof:  Let  D  be  the  finite  set  of  expressions  contained  in  2  either  directly  or  as  a  subexpression  of 
some  expression  contained  in  2.  Downey,  Taijan.  and  Sethi  [5J  show  how  to  construct  a  function  F 
on  D  such  that  for  any  two  expressions  u  and  v  in  D,  F(u)  equals  F(v)  just  in  ease  the  equality  u  =  v 
follows  from  2.  Furthermore  a  tabular  representation  of  the  function  F  can  be  computed  in 
nlog(n)  average  time  where  n  is  the  total  size  of  2.  Such  a  function  F  can  be  translated  into  a 
normal  context  free  expression  grammar  G  in  the  following  way:  Hach  expression  v  in  D  is 
associated  with  a  non-terminal  symbol  X(v)  such  that  for  any  two  expression  u  and  v  in  D.  X(u) 
equals  X(v)  just  in  case  F(u)  equals  F(v).  Now  for  each  terminal  symbol  a  in  D  we  include  the 
production  X(a)=*a  and  for  each  expression  v  in  D  of  the  form  <\  j  V2  ...  vn>  we  include  the  the 
production  X(v)=><X(vj)X(v2) ...  X(vn)>.  The  grammar  G  can  be  computed  from  F  in  time 
proportional  to  the  total  size  of  2. 

To  show  that  G  is  normal  it  is  sufficient  to  show  that  no  two  distinct  productions  have  the  same 
right  hand  side.  Let  u  and  v  be  two  expressions  in  I)  such  that  the  production  corresponding  to  u 
has  the  same  right  hand  side  as  the  production  corresponding  to  v.  It  is  sufficient  to  prove  that  X(u) 
must  equal  X(v)  since  then  the  production  corresponding  u  would  be  the  same  as  the  production 
corresponding  to  v.  Since  the  right  hand  sides  of  the  productions  corresponding  to  u  and  v  arc  die 
same  either  u  and  v  arc  both  the  same  terminal  symbol  (in  which  ease  the  result  is  trivial)  or  u  is  the 
form  <u  j  U2  ...  un>.  v  was  of  the  form  <Vj  V2  ...  vp>,  and  X(uj)  equals  X(Vj)  for  each  i.  However  in 
the  latter  ease  each  equality  Uj  =  Vj  would  be  dcducible  from  2  and  therefore  by  substitutivily  the 
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equality  u  =  v  would  also  he  deduciblc  from  2  .ind  thus  \(u)  would  equal  X(v). 

It  remains  to  show  that  ~  p  is  tlie  same  as  ~v  where  I'  is  the  maximal  generator  function  of 
G.  f  irst  note  that  if  u  =  v  is  an  equality  in  2  then  u  and  v  are  in  I)  and  X(u)  must  equal  X(v). 
l;urthcrmorc  it  is  easily  shown  by  induction  that  for  any  expression  u,  if  u  is  in  I )  then  Ru)  equals 
X(u).  Thus  if  u  =  v  is  an  equality  in  2  then  Ru)  equals  Rv)  and  tints  ~p  subsumes  2. 
Furthermore  lemma  2.2  ensures  that  ~p  is  a  congruence  relation  and  therefore  subsumes  any 
equality  deduciblc  from  2.  It  remains  only  to  show  that  any  equality  subsumed  by  ~p  is 
deduciblc  from  2.  To  show  this  first  note  that  for  each  non-terminal  symbol  Y  in  G  there  is  at  least 
one  expression  w  in  D  such  that  Y  equals  X(w).  Thus  each  non-terminal  symbol  Y  can  be 
associated  with  an  expression  u(Y)  in  D  such  that  X(u(Y))  equals  Y.  Now  for  any  expression  form 
a  let  u(a)  be  the  result  of  replacing  each  non-terminal  symbol  Y  in  a  by  u(Y).  It  can  now  be  shown 
by  a  standard  induction  that  for  any  expression  w  the  equality  w  =  u(Rw))  is  deduciblc  from  2  (the 
details  arc  left  to  the  reader).  Finally  if  an  equality  w  =  v  is  subsumed  by  ~p  then  since  Rw) 
equals  Rv),  and  the  two  equalities  w  =  u(Rw))  and  v  =  u(Rv))  are  both  dcducible  from  2  the 
equality  w  =  v  can  also  be  deduced  from  2. 


A  set  of  expressions  l.  will  be  called  a  context  free  expression  language  if  there  is  a  context  free 
expression  grammar  G  and  a  non-terminal  symbol  X  of  G  such  that  L  is  the  language  generated  X  under  G. 
Now  let  G  be  any  normal  grammar  and  let  T  be  the  maximal  generator  function  of  G.  Note  that  for  any 
expression  u,  if  Ru)  is  a  non-terminal  symbol  of  G  then  lu^  is  a  context  free  expression  language. 

Furthermore  if  Ru)  is  not  a  non-terminal  symbol  then  one  can  simply  construct  a  new  grammar  G’  by  adding 


a  new  non-terminal  symbol  X  and  the  production  X=>Ru)  so  that  |u|~^  is  the  language  generated  by  X 
under  G'.  In  either  case  is  a  context  free  expression  language.  These  observations  and  the  above 

lemmas  now  yield  .a .  main  result  of  this  section: 


Theorem  2.4  For  any  finite  set  of  equations  2  and  any  expression  u,  |u|^  is  a  context  free 
expression  language. 


At  this  point  it  is  useful  to  introduce  another  simple  example.  Consider  the  following  three 
equations: 


<f<f<fa»>  =  a 
<fa>  =  b 
<fb>  =  c 


These  equations  correspond  to  the  following  normal  grammar: 
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A  =>  a  |  <1-  C> 
li  =*  b  |  <K  A> 

C  =>  c  |  <F  B> 

F=>f 

Under  this  grammar  T(<g  <f  b»)  equals  <g  C>  and  the  equivalence  class  of  <g  <f  b»  equals  the  set 
of  expressions  generated  by  <g  C>.  What  expressions  arc  there,  if  any.  which  arc  both  equivalent  to  <g  <f  b» 
and  contain  only  the  symbols  g,  f  and  a?  'Hie  answer  is  the  set  of  expressions  generated  by  the  expression 
form  <g  C>  under  the  following  subset  of  productions: 

A  =*  a  |  <F  C> 

B  =*  <F  A> 

C  =>  <F  B> 

F  =>  f 

In  general  consider  a  division  of  the  terminal  symbols  into  "dependent"  and  "independent" 
symbols.  The  following  theorem  provides  a  technique  for  solving  for  expressions  in  terms  of  independent 
symbols. 


Theorem  2.5:  Let  G  a  normal  grammar  with  maximal  generator  function  T,  and  let  G’  be 
that  subset  of  the  productions  of  G  which  do  not  contain  any  dependent  symbols.  For  any 
expression  u  the  language  generated  by  Tfu)  under  G’  is  precisely  the  set  of  expressions 
which  are  equivalent  to  u  under  ~p  and  which  do  not  contain  any  dependent  symbols. 

Proof:  Any  expression  generated  by  T(u)  under  G’  contains  no  dependent  symbols  and  is 
equivalent  to  u  under  i.e.  is  a  solution  for  u.  On  the  other  hand  any  solution  v  for  u  must  be 
equivalent  to  u  and  therefore  must  be  generated  by  T(u)  under  G.  But  in  this  ease  since  v  docs  not 
contain  any  dependent  symbols  the  derivation  of  v  from  T(u)  must  not  involve  any  productions 
which  contain  dependent  symbols.  Thus  v  must  be  generated  by  f(u)  under  G’  as  well. 
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3.  BOOLIAN  CONGRl  K\(  I!  RI  GA  I  IONS 


Now  that  a  good  representation  has  been  developed  for  equivalence  classes  under  finitclv  equational 
congruence  relations  we  turn  our  attention  to  Boolean  congruence  relations.  An  equational  Boolean  formula 
over  an  alphabet  A  is  a  formula  which  is  built  up  from  equalities  between  expressions  over  A  using  negations, 
disjunctions,  conjunctions,  and  implications  in  the  standard  way.  The  congruence  closure  of  an  equational 
Boolean  formula  Q  is  defined  to  be  the  congruence  relation  ~q  on  expressions  such  that  u~q\  just  in  ease 
u  =  v  is  dcduciblc  from  Q  using  the  standard  deduction  rules  for  Boolean  connectives  and  the  deduction  rules 
for  equalities  described  earlier.  For  an  arbitrary  expression  u  the  equivalence  class  of  u  under  the  congruence 
closure  of  Q  will  be  denoted  by  |u|q.  It  will  be  shown  in  this  section  that  |u|q  is  either  the  universal  relation 
or  a  context  free  expression  language. 

Let  P  be  an  equational  Boolean  formula  which  is  a  conjunction  uj  =  \  j  A  u2*v2  A  u^  =  vj  ...  of 
equalities  and  negations  of  equalities  and  let  2  be  the  set  of  equalities  which  appear  in  positive  form  in  P. 
Using  the  techniques  described  in  the  previous  section  one  can  compute  a  normal  expression  grammar  G  with 
maximal  generator  function  F  such  that  ~p  is  the  same  as  Now  the  conjunction  P  is  satisfiablc  just  in 
ease  there  is  no  negated  equality  Uj*Vj  appearing  in  P  such  that  the  equality  Uj  =  Vj  is  deducible  from  2,  or 
equivalently  such  that  TfUj)  equals  r(vj).  If  the  conjunction  is  unsatisfiable  then  anything  follows  from  P  and 
thus  tiic  congruence  closure  of  P  is  the  universal  relation.  If  the  conjunction  P  is  satisfiablc  then  the 
congruence  closure  of  P  is  just  the  congruence  closure  of  2  and  for  any  expression  u,  |u|p  equals  |u|^  which 
equals  the  language  generated  by  T(u)  under  G. 

Any  equational  Boolean  formula  can  be  converted  to  disjunctive  normal  form.  i.c.  is  equivalent  to  a 
disjunction  of  conjunctions  of  equalities  and  negations  of  equalities.  Let  Q  any  equational  Boolean  formula 
and  let  Pj  V  P2  V...V  Pn  be  a  disjunctive  normal  form  of  Q  where  each  Pj  is  a  conjunction  of  equalities  and 
negations  of  equalities.  An  equality  u  =  v  follows  from  Q  just  in  case  it  follows  from  each  Pj.  In  other  words 
is  the  intersection  of  the  ~p  ’s.  But  by  the  above  remarks  the  congruence  closure  of  a  given  Pj  is  cither 
the  universal  relation  or  the  congruence  closure  of  the  positive  equalities  in  that  conjunction.  These  remarks 
lead  to  the  following  lemma: 

l  emma  3.1:  The  congruence  closure  of  any  equational  Boolean  formula  is  either  the 

universal  relation  or  a  finite  intersection  of  finitely  equational  congruence  relations. 

Given  the  above  observations  it  is  clear  that  one  can  decide  whether  an  equality  u  =  v  follows  from  a 
formula  Q  by  first  converting  0  to  disjunctive  normal  form  and  then  seeing  if  u  =  v  follows  from  each 
disjunct.  However  it  is  not  clear  that  a  mechanism  which  can  solve  for  u  under  a  set  of  equalities  can  be 
extended  to  a  mechanism  for  solv  ing  for  u  under  an  arbitrary  equational  Boolean  formula.  What  is  needed  is 
a  good  characterization  of  the  equivalence  class  of  u  under  0-  As  a  simple  example  consider  the  following 
formula  in  which  <1°  a>  is  an  abbreviation  for  an  expression  of  the  form  <f  <f...<f  a>...»  where  the  symbol  f 
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appears  n  times. 

<f5a>=a  V  <f7a>=a 

The  equivalence  class  of  a  under  this  formula  is  the  set  of  expressions  of  the  form  <fn  a>  where  n  is  a 
multiple  of  35.  One  approach  to  the  problem  of  solving  for  expressions  under  a  Boolean  formula  0  would  be 
to  try  to  find  a  finite  set  of  equalities  2  such  that  ~q  was  the  same  as  ITic  following  theorem 

demonstrates  that  this  can  not  be  done  in  general. 

Theorem  3.2:  The  intersection  of  two  finitely  cquational  congruence  relations  need  not  be 
finitely  equational. 

Proof  sketch:  Consider  the  following  pair  of  sets  of  equations: 


Zj:  {a  =  b} 

Z2:  (<fa>  =  a,  <f  b>  =  b,<ga>  =  <gb>} 

The  intersection  of  and  *s  same  as  ~q  where  Q  is  the  Boolean  formula: 

a=b  V  (<f  a>=a  A  <f  b>=b  A  <g  a>=<g  b>) 


The  relation  ~q  is  the  same  as  the  relation  where  n  is  the  infinite  set  of  equalities  of  the 
form: 

<g<fn  a»  =  <g<fnb» 

Now  ~j-j  is  finitely  equational  just  in  case  there  is  a  finite  set  of  equalities  2  such  that  is 
the  same  as  If  such  a  set  2  exists  then  if  u  =  v  is  an  equality  in  2  then  and  thus  the 

equality  u  =  v  must  be  deducible  from  n.  But  an  equality  is  dcduciblc  from  n  just  in  ease  it  is 
dcducible  from  some  finite  subset  of  n  and  thus  all  of  the  equalities  in  2  must  be  deducible  from  a 
single  finite  subset  of  n.  Thus  is  finitely  equational  just  in  case  it  is  the  congruence  closure  of 
some  finite  subset  of  n.  However  given  the  above  definition  of  II  no  finite  subset  of  H  can  imply 
all  of  the  equalities  in  n  so  is  not  finitely  cquational. 

The  above  theorem  says  that  there  is  a  Boolean  formula  Q  such  that  ~q  is  not  finitely  cquational. 
However  it  is  still  possible  that  for  each  expression  u  there  is  a  finite  set  of  equalities  2  such  that  |u|q  equals 
lul^  (remember  that  there  arc  infinitely  many  such  expressions  u). 

l  et  Q  be  an  arbitrary  equational  Boolean  formula.  By  Lemma  3.1  it  is  possible  either  to  determine 
that  the  congruence  closure  of  Q  is  the  universal  relation  or  to  find  a  finite  collection  {2|  Z2  ...  2n}  of  finite 
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scls  of  equations  such  that  the  congruence  closure  of  Q  is  the  intersection  of  the  congruence  closures  of  the 
2j's.  Thus  for  any  expression  u  we  have  the  following  relation: 

|u|q  =  lul^niul^rL.lul^ 

Furthermore  for  each  2j  there  is  a  normal  grammar  Gj  with  an  associated  maximal  generator 
function  f"j  such  that  is  the  language  generated  by  Tj(u)  under  Gj.  This  implies  that  |u|q  is  the 
intersection  of  a  finite  collection  of  context  free  expression  languages.  While  it  is  well  known  that  the 
intersection  of  two  context  free  string  languages  need  not  be  a  context  free  language  there  arc  fundamental 
differences  between  string  languages  and  expression  languages.  Specifically  expressions  have  a-priori  phrase 
structure  while  strings  do  not.  'Hiis  difference  is  responsible  for  the  fact  that  membership  in  a  context  free 
expression  language  can  be  determined  in  linear  time  (as  was  mentioned  in  the  previous  section)  and  it  is  also 
responsible  for  the  following  theorem: 

Theorem  3.3:  The  intersection  of  two  context  free  expression  languages  is  a  context  free 
expression  language. 

Proof.  An  expression  grammar  will  be  called  one  level  if  every  production  is  either  of  the  form 
X=>a  where  a  is  a  terminal  symbol  or  of  the  form  X  =>  <Xj  X2 ...  Xn>  where  each  X;  is  a 
non-terminal  symbol.  (A  normal  grammar  is  a  one  level  grammar  in  which  no  two  productions 
have  the  same  right  hand  side.)  It  is  easy  to  show  for  any  context  free  expression  language  L  there 
is  a  one  level  grammar  G  such  that  L  is  the  language  generated  by  a  non-terminal  symbol  of  G. 
Given  these  remarks  Theorem  3.3  follows  directly  from  the  following  Lemma. 

Lemma  3.4:  For  any  two  one  level  expression  grammars  G  j  and  G2  there  exists  a  grammar 
denoted  as  GjDG2  such  that  for  each  non-terminal  X  of  Gj  and  non-terminal  Y  of  G2 
there  exists  a  non-terminal  of  G  jflG2  denoted  by  XflY  such  that  the  language  generated  by 
XflY  under  GjDG2  is  the  intersection  of  the  languages  generated  by  X  and  Y  under  G  j 
and  G2  respectively. 

Proof:  Given  Gj  and  G2  define  GjHG2  to  be  the  one  level  grammar  meeting  the  following 
conditions: 

1)  The  set  of  non-terminal  symbols  of  GjflG2  is  the  cross  product  of  the  non-terminal  symbols  of 
Gj  and  the  non-terminal  symbols  of  G2.  In  other  words  for  each  non-terminal  symbol  X  of  Gj 
and  non-terminal  Y  of  G2  there  exists  a  non-terminal  symbol  of  GjDG2  which  will  be  denoted 
here  by  XflY. 


2)  For  each  terminal  symbol  a  a  production  XHY  =*  a  is  in  Gj(TG2  just  in  case  the  production 
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X  =>  a  is  in  G  j  and  the  production  V  =>  a  is  in  G-). 

3)  A  production  of  the  form  XflY  =>  <XjnYj  X2HY2  ...  XnflYn>  is  in  G]nG2  just  in  case  the 
production  X  =>  <X  j  X2 ...  Xn>  is  in  G  j  and  the  production  Y  =>  <  Y  j  Y2  ...  Yn>  is  in  G2. 

Let  u  be  any  expression  over  terminal  symbols.  It  will  first  be  shown  by  induction  on  u  that  if 
XnY=»*u  then  X=**u  and  Y=**u.  If  u  is  a  terminal  symbol  then  this  result  follows  directly  from 
the  definition  of  Gjr^.  If  u  is  of  the  form  <uj  U2 ...  un>  then  if  XDY=>  *u  then  there  must  be  a 
production  ofG|DG2  of  the  form  XflY  =>  <XjnYj  X2f!Y2  ...XnnYn>  such  that  X-ft  Y  j=>  *uj 
for  each  i.  By  the  definition  of  GjflG2  this  implies  that  the  production  X  =>  <Xj  X2  ...  Xn>  is  in 
Gj  and  the  production  Y  =>  <Yj  Y2 ...  Yn>  is  in  G2.  Furthermore  by  the  induction  hypothesis  it 
must  be  the  ease  that  Xj=>*Uj  and  Yj=>*Uj  for  each  i.  Therefore  it  must  be  the  ease  that  X=>*u 
and  Y=»*u  and  the  induction  is  complete. 

Now  it  will  be  shown  by  induction  on  expressions  that  if  X=>*u  and  Y=>*u  then  XHY=>*u. 
Again  if  u  is  a  terminal  symbol  then  the  result  follows  directly  from  the  definition  of  G|flG2.  If  u 
is  of  the  form  <Uj  U2 ...  un>  then  if  X=>*u  and  Y=>*u  there  must  be  a  production  in  Gj  of  the 
form  X  =>  <X^  X2 ...  XR>  and  a  production  in  G2  of  the  form  Y  =>  <Yj  Y2 ...  Yn>  such  that 
Xj=>*Uj  and  Yj=»*Uj  for  each  i.  But  by  the  definition  of  GjflG2  this  implies  that  the  production 
XflY  =>  ^jlTYj  X2nY2  ...  XnflYn>  is  in  GjHG2.  Furthermore  by  the  induction  hypothesis 
XjH Yj=>  *Uj  for  each  i.  Therefore  XflY  =>  *u  and  the  induction  argument  is  complete. 

Finally  since  XflY=>*u  just  in  case  X=»*u  and  Y=>*u  the  language  generated  by  XflY 
under  G^fTG2  is  exactly  the  intersection  of  the  language  generated  by  X  under  Gj  with  the 
language  generated  by  Y  under  G2. 

Theorem  2.4,  Lemma  3.1,  and  Theorem  3.3  now  immediately  imply  the  following  corollary: 

Corollary  3.5:  For  any  consistent  equational  Boolean  formula  Q  and  any  expression  u,  |u|q  is 
a  context  free  expression  language. 

For  any  two  grammars  G  j  and  G2  the  size  of  the  grammar  GjDG2  is  proportional  to  the  product  of 
the  sizes  of  the  grammars  Gj  and  G2.  However  it  seems  reasonable  to  expect  that  for  most  pairs  X,  Y  of 
non-terminal  symbols  ofGj  and  G2  respectively  the  language  generated  by  X  and  the  language  generated  by 
Y  will  be  disjoint  and  therefore  the  language  generated  by  XHY  will  be  empty.  A  non-terminal  symbol  of  a 
grammar  will  be  called  coherent  if  the. language  generated  by  that  symbol  is  not  empty.  The  coherent 
fragment  of  a  grammar  G  is  defined  to  be  the  grammar  resulting  from  removing  all  productions  which  contain 
non-cohcrent  non-terminal  symbols.  The  coherent  fragment  of  a  grammar  can  be  computed  in  time 
proportional  to  the  size  of  that  coherent  fragment. 

Ihc  fragment  of  GjnG2  which  is  relevant  to  describing  the  language  generated  by  XflY  can  be 
restricted  even  further.  A  non-terminal  symbol  Z  will  be  said  to  be  accessible  from  the  non-terminal  W  if 
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there  is  a  production  whose  left  hand  side  is  W  and  whose  right  hand  side  contains  Z.  or  if  there  is  a 
non-terminal  V  such  that  V  is  accessible  from  W  and  /.  is  accessible  from  V.  for  a  given  grammar  G  the 
fragment  of  G  accessible  from  W  is  the  subset  of  productions  of  G  which  contain  only  symbols  accessible 
from  W.  Ihc  fragment  of  G  accessible  from  W  can  be  computed  in  time  proportional  to  the  size  of  that 
fragment  If  one  is  interested  in  the  language  generated  by  XflY  under  Gjf^  then  one  need  only  consider 
the  fragment  of  GjfKI^  which  is  accessible  from  XHY. 

Of  course  the  intersection  of  a  large  set  of  context  free  expression  languages  can  be  done  by 
iteratively  applying  the  intersection  algorithm  implicit  in  the  proof  of  Lemma  3.4.  In  the  worst  case  the  size  of 
the  grammar  can  grow  exponentially  in  the  number  of  languages  which  are  intersected.  However  it  is 
expected  that  such  exponential  growth  will  not  usually  arise  since  at  each  step  one  can  restrict  the  grammar  to 
the  coherent  subset  which  is  accessible  from  the  non-terminal  of  interest 

The  mechanisms  that  have  been  described  so  far  can  be  extended  to  deal  with  conditional 
expressions.  For  a  given  alphabet  A  of  terminal  symbols  one  can  define  a  conditional  expression  to  be  either  a 
simple  expression  (a  non-conditional  expression)  or  an  n-tuplc  <if  P  Uj  U2>  where  P  is  an  cquational  Boolean 
formula  and  Uj  and  U2  are  conditional  expressions.  As  was  mentioned  earlier  it  is  straightforward  to  give  a 
semantics  for  expressions  and  Boolean  formulas.  Such  a  semantics  can  be  extended  to  a  semantics  of 
conditional  expressions  by  defining  the  denotation  of  <if  P  Uj  i«2>  to  be  the  denotation  of  Uj  if  P  is  true  and  to 
be  the  denotation  of  U2  if  P  is  false.  Now  for  a  conditional  expression  w  and  a  Boolean  formula  Q,  |w|q  is 
defined  to  be  the  set  of  non-conditional  expressions  which  equal  w  in  all  interpretations  which  make  Q  true, 
llus  definition  yields  the  following  relation: 


|<if  Puj  u2>|q  =  MilpAQ  n  lu2l(iP)AQ 

Given  the  results  of  this  section  the  above  relation  provides  a  way  of  computing  a  grammar 
representing  |w|q  for  any  conditional  expression  w  and  consistent  Boolean  formula  Q.  Note  that  since 
elements  of  |w)q  must  be  non-conditional  expressions  |w|q  may  be  empty. 
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4.  SIMPUFICATION 


A  simplicity  order  on  expressions  is  a  partial  order  such  that  u  is  simpler  than  v  just  in  ease  u  is  less 
than  v  under  that  order.  For  the  purposes  of  this  section  let  ~  be  an  arbitrary  congruence  relation  on 
expressions  and  for  any  expression  u  let  |u|~  denote  the  equivalence  class  of  u  under  Note  that  since  the 
simplicity  order  need  not  be  total  or  well  founded  there  need  not  be  a  unique  simplest  expression  in  |u| — 
1'he  following  definitions  arc  important  in  discussing  simplification  under  congruence  relations. 

Definitions:  A  simplicity  bound  under  ~  is  an  expression  w  such  that  there  is  no  expression 
in  which  is  simpler  than  w.  An  expression  w  will  be  called  simplified  under  ~  just  in 
ease  it  and  all  of  its  subexpressions  are  simplicity  bounds  under  A  simplification  of  an 
expression  u  under  ~  is  an  expression  in  |u|~  which  is  simplified  under  ~. 

The  main  result  of  this  section  is  that  for  any  "well  behaved"  simplicity  order  and  any  consistent 
equational  Boolean  formula  Q  the  set  of  simplifications  of  an  expression  u  under  ~q  is  a  context  free 
expression  language.  However  it  is  important  to  note  that  there  are  "pathological"  simplicity  orders  for  which 
this  result  does  not  hold.  In  particular  let  2  denote  the  single  equation  <f  a> =a.  Clearly  |a|^  is  die  set  of  all 
expressions  of  the  form  <l”  a>.  Now  suppose  that  the  simplicity  order  was  such  that  larger  expressions  were 
always  simpler  than  smaller  expressions.  Since  the  set  |a|^  contains  arbitrarily  large  expressions  it  would  not 
contain  a  simplicity  bound.  To  eliminate  this  problem  one  might  required  that  the  simplicity  order  be  well 
founded,  i.e.  that  there  does  not  exist  any  sequence  of  ever  simpler  expressions.  However  even  under  this 
restriction  there  are  pathological  orderings.  Suppose  for  example  that  all  expressions  other  than  a  arc  equally 
simple  and  but  simpler  than  a.  In  this  case  any  expression  of  the  form  <f”  a>  (for  n  greater  dian  0)  is  a 
simplicity  bound  for  |a|j..  However  all  expressions  in  |a|^  contain  a  which  is  not  a  simplicity  bound  and 
therefore  no  expression  in  |a|^  is  a  simplification  of  a.  The  following  definition  establishes  a  class  of 
non -pathological  or  well  behaved  simplicity  orders.  Note  that  well  behaved  orders  need  not  be  well  founded. 

Definition:  A  well  behaved  simplicity  order  on  expressions  is  defined  here  as  a  partial  order 
on  expressions  satisfying  the  following  conditions: 

1)  The  order  is  pseudo-total,  which  means  that  there  is  a  function  L  from  expressions  to  a 
totally  ordered  set  such  that  u  is  simpler  than  v  just  in  case  MuKMv).  Such  a  function  L  will 
be  called  a  totalizer  for  the  simplicity  order  and  L(u)  will  be  called  the  simplification  level  of 
u.  Note  that  not  all  partial  orders  arc  pseudo-total,  in  particular  consider  an  order  and  three 
objects  x,  y,  and  z  such  that  x  is  less  than  y  but  z  is  unordcrcd  with  respect  to  both  x  and  y.  If 
such  an  order  were  pseudo-total  then  since  z  and  x  arc  unordcrcd  Mz)  would  have  to  equal 
Mx).  Similarly  Mz)  would  equal  My).  Hut  then  I.(x)  would  have  to  equal  I.(y)  which 
conflicts  with  the  ordering  between  x  and  y. 
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2)  lhc  tprdcr  satisfies  the  following  monotutucity  condition:  A  context  is  defined  to  be  an 
expression  with  an  "open  slot"  in  it.  For  any  expression  w  and  any  context  C  let  C'|*)  he  Uie 
result  of  replacing  the  open  slot  of  C  with  w.  lor  example  if  C  is  the  context  <f  <r  a>  •  li> 
then  C|<g  h>J » the  expression  <f  <g  a>  <g  h>  h>.  lhc  monotonicity  condition  is  that  for  any 
two  expression  u  and  v  and  any  context  C.  if  I  (u)<JUv)  then  UClu|)<MCfv])  where  I.  is 
any  totalizer  for  the  simplicity  order. 

3)  So  expression  is  simpler  than  one  of  its  subexpressions. 

Conditions  on  simplicity  orders,  such  as  the  monotonicity  condition  above,  can  often  be  most  readily 
expressed  in  terms  of  a  totalizing  function  L  Such  conditions  can  always  be  thought  of  as  direct  conditions 
on  the  simplicity  order  and  therefore  the  truth  of  such  conditions  is  independent  of  the  choice  of  the  totalizing 
function.  In  the  remainder  of  this  section  L  will  always  denote  a  totalizer  for  the  simplicity  order. 

The  monotonicity  condition  on  well  behaved  simplicity  orders  warrants  some  investigation.  First  it 
is  easily  shown  that  the  monotonicity  condition  implies  that  if  Uu)  equals  Mv)  then  for  any  context  C, 
l.(C(ul)  equals  MC|vJ)  (if  L(u)  equals  L(v)  then  L(C(u])^MC(v])  and  UQvl)<UQu])).  Thus  the  level  of  an 
expression  is  determined  by  the  levels  of  its  subexpressions,  'litis  implies  that  any  well  behaved  simplicity 
order  can  be  characterized  by  a  triple  <D,  Lj,  L^>  where  D  is  a  totally  ordered  set  of  "simplicity  levels".  I.j  is 
a  function  from  symbols  to  D,  and  is  a  mapping  from  n-tuples  of  elements  of  D  into  D  such  that  the 
following  relations  hold: 


L(a)  =  Lj(a)  for  symbols  a 
M<Uj  u2 ...  un>)  =  I^Lfuj)  Mu2) . ..  Uun)>) 

Note  that  the  above  relations  could  be  taken  as  a  definition  of  I.  in  terms  of  Lj  and  Lj.  The 
condition  that  no  expression  is  simpler  than  one  of  its  subexpressions  is  equivalent  to  the  following  condition 
on  lq. 


L^KI/uj)  Uu2) ...  Uun)>)  £  maxfKuj)  Uu2) ...  l-(un)) 

Unfortunately  the  above  conditions  on  well  behaved  simplicity  orders  arc  not  quite  strong  enough  to 
ensure  that  every  expression  has  a  simplification.  There  arc  now  two  ways  of  ensuring  this,  lhc  first  is  to 
require  that  the  simplicity  order  be  well  founded.  The  second  is  to  require  that  the  congruence  relation  be 
~q  for  some  consistent  Boolean  formula  Q.  The  next  theorem  establishes  that  the  well  founded  ness 
condition  is  sufficient  for  arbitrary  congruence  relations. 

Theorem  4.1:  For  any  well  founded  well  behaved  simplicity  order  and  any  congruence 
relation  every  expression  has  a  simplification. 
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Proof:  An  expression  w  will  be  called  "cleaner"  than  an  expression  u  just  in  case  the  pair  <1  .<  w)  w> 
is  less  Ulan  the  pair  <1  .(u)  u>  under  a  lexicographical  ordering  where  die  second  components  of  the 
pairs  arc  compared  under  the  subexpression  order.  In  other  words  w  is  cleaner  than  u  just  in  case 
Uw)  is  less  than  1  (u)  or  L(w)  equals  L(u)  but  w  is  proper  a  subexpression  of  u.  Since  well  behaved 
simplicity  orders  are  well  founded  the  cleanliness  ordering  is  also  well  founded,  i.c.  there  are  no 
infinite  chains  of  ever  cleaner  expressions. 

The  proof  that  a  simplification  always  exists  will  be  done  by  induction  on  cleanliness.  For  an 
arbitrary  expression  u  it  is  assumed  that  all  expressions  cleaner  than  u  have  simplifications.  It  is 
sufficient  to  show  that  this  implies  that  u  has  a  simplification. 

If  there  is  an  expression  w  in  |u|^.  which  is  simpler  than  u  then  w  is  cleaner  than  u  and 
therefore  has  a  simplification  which  must  also  be  a  simplification  of  u.  On  the  other  hand  suppose 
that  there  is  no  expression  in  |u|~  which  is  simpler  than  u  (i.c.  u  is  a  simplicity  bound).  Because  no 
expression  can  be  simpler  than  one  of  its  subexpressions  if  v  is  a  proper  subexpression  of  u  then 
L(v)<L(u)  and  therefore  v  must  be  cleaner  than  u.  Thus  by  the  induction  hypothesis  all 
subexpressions  of  u  have  simplifications.  Let  w  be  the  result  of  replacing  all  of  the  top  level 
subexpressions  of  u  with  simplifications  of  those  expressions.  By  substitutivity  w  is  in  |u|~  and  all 
proper  subexpressions  of  w  arc  simplicity  bounds.  To  show  that  w  is  a  simplification  of  u  it  remains 
only  to  show  that  w  is  a  simplicity  bound  for  |u|~.  However  since  u  is  a  simplicity  bound  and  since 
the  monotonicity  condition  implies  that  L(w)<l.(u),  w  must  also  be  a  simplicity  bound. 

For  a  congruence  closure  ~q  of  a  consistent  Boolean  formulae  0  the  well  foundedness  condition 
can  be  removed  from  the  above  theorem.  The  first  step  in  establishing  this  result  is  to  show  that  for  any 
consistent  Boolean  formula  Q  and  expression  u  there  is  a  grammar  G  which  captures  all  of  the  information  in 
Q  relevant  to  simplifying  u.  The  precise  conditions  placed  on  the  grammar  G  is  defined  below  in  terms  of  u 
and  ~q. 

Definition:  Let  G  be  any  normal  grammar  with  maximal  generator  function  T.  The 
grammar  G  will  be  said  to  cover  an  expression  v  just  in  case  T(v)  is  a  non-terminal  symbol. 

The  grammar  G  will  be  said  to  approximate  a  congruence  relation  ~  just  in  case  |v|~  equals 
the  language  generated  by  T(v)  for  all  expressions  v  which  are  covered  by  G. 

lemma  4.2:  For  any  consistent  equational  Boolean  formula  Q  and  any  expression  u  there  is  a 
normal  grammar  G  which  approximates  ~q  and  which  covers  u. 

Proof:  It  was  demonstrated  in  the  previous  section  that  if  Q  is  consistent  then  there  exists  a  finite 
set  {2j  ^2 ...  Zn}  of  finite  sets  of  equations  such  that  the  relation  ~q  equals  the  intersection  of 
the  relations  ~£..  For  each  X|  there  exists  a  normal  grammar  Gj  with  maximal  generator  function 
Tj  such  that  25p  equals  .  It  can  be  assumed  without  loss  of  generality  that  each  Gj  covers  u. 
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Now  using  the  technique  developed  in  the  proof  of  lemma  3.4  one  can  construct  the  grammar 
GjflG2...nGn  such  that  the  language  generated  by  each  non-terminal  \|DX->...nXn  of 
GjnG>..nGn  is  precisely  the  intersection  of  the  languages  generated  by  each  Xj.  From  the 
definition  of  this  grammar  given  in  the  proof  of  lemma  3.4  it  is  easy  to  show  that  this  grammar  is 
nonnal.  To  show  that  this  grammar  approximates  ~q  let  v  be  any  expression  cosercd  by 
GjOG2...nGn.  The  non-terminal  of  GjnG2-.nGn  which  generates  v  must  be 
r1(v)nr2(v)...nrn(v)  and  the  language  generated  by  this  is  precisely  the  intersection  of  the  sets 
|v|2  which  is  precisely  |v|q.  Finally  since  u  is  covered  by  each  Gj  the  nonterminal 
f  1(u)nr2(u)...nrn(u)  generates  u  and  thus  u  is  covered  by  GjnG2...nGn. 


It  is  interesting  to  note  that  a  normal  grammar  G  can  approximate  a  congruence  relation  ~  even  if 
~  is  not  finitely  cquational.  In  particular  the  relation  ~p,  where  T  is  the  maximal  generator  function  of  G, 
need  not  be  the  same  as  if  w  is  not  covered  by  G  then  may  be  a  proper  subset  of  M-w. 


The  following  lemma  tightens  the  relationship  between  a  congruence  relation  ~  and  a  grammar  G 
which  approximates  it. 


lemma  4.3 :  If  a  normal  grammar  G  approximates  a  congruence  relation  ~  and  u  is  an 
expression  covered  by  G  then  the  set  of  simplifications  of  u  under  ~  equals  the  set  of 
simplifications  of  u  under  ~p  where  F  is  the  maximal  generator  function  of  G. 

Proof.  First  note  that  an  expression  covered  by  G  is  a  simplicity  bound  under  ss  just  in  case  it  is  a 
simplicity  bound  under  ~p.  This  is  because  if  v  is  an  expression  covered  by  G  then  equals 
Ivljw  .  Second  note  that  if  v  is  an  expression  covered  by  G  then  since  G  is  normal  (and  therefore 

one  level)  each  subexpression  of  v  must  be  generated  by  some  non-terminal  symbol  of  G. 
I'herefore  each  subexpression  of  an  expression  covered  by  G  is  covered  by  G.  The  above  two 
observations  imply  that  an  expression  covered  by  G  is  simplified  under  ~  just  in  case  it  is 
simplified  under  25p.  Finally  note  that  every  expression  in  |u|~  is  covered  by  G  since  |u|^  is 
precisely  ihc  set  generated  by  the  non-terminal  symbol  F(u).  Thus  the  set  of  simplifications  of  u 
under  ~  consists  of  those  expressions  in  |u|~^  which  arc  simplified  under  ~p,  i.e.  the  set  of 

simplifications  of  u  under  ~p. 

The  following  theorem  finally  establishes  the  desired  result  concerning  the  existence  of 
simplifications  under  consistent  Boolean  formulae. 


Theorem  4.4\  For  any  consistent  Boolean  formula  Q.  any  well  behaved  simplicity  order,  and 
any  expression  u.  u  has  a  simplification  under  ~q. 

Given  lemmas  4.2  and  4.3  the  above  theorem  follows  from  the  following  lemma: 


XJ 


i 
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l.emmu  4.x  I.ct  G  be  any  normal  grammar  with  maximal  generator  function  I'.  I  or  any 
well  behaved  simplicity  order  every  expression  covered  by  G  has  a  simplification  under  ~  | 

The  above  lemma  will  be  shown  by  constructing  an  algorithm  which  takes  a  normal  grammar  and 
assigns  each  non-terminal  symbol  X  an  expression  s(X)  which  is  simplified  under  ~p  and  which  is  generated 
by  X.  11100  for  any  expression  u  covered  by  G,  s(r(u))  is  a  simplification  of  u. 

The  procedure  for  constructing  the  assignment  s  is  presented  below.  At  each  point  in  the 
computation  s  is  defined  on  a  subset  of  non-terminal  symbols.  Such  a  partial  assignment  s  can  be  extended 
to  expression  forms  of  the  form  <a^  ...  an>  by  setting  s(<aj  aj  ...  an>)  equal  to  <s(« jjslo^)  •••  s(an)> 

whenever  s(aj)  is  defined  for  each  aj. 

At  each  point  in  the  computation  there  is  also  a  queue  of  pairs  <X,  u>  where  X  is  a  non-terminal 
symbol  and  u  is  an  expression  over  terminal  symbols.  Each  pair  <X,  u>  on  this  queue  should  be  thought  of  as 
a  constraint  on  the  value  of  s(X)  which  states  that  L(s(X))  must  be  less  than  or  equal  to  L(u).  This  queue  is 
always  sorted  with  respect  to  the  terminal  expressions  such  that  for  any  two  pairs  <X,  u>  and  <Y.  v>  on  the 
queue  if  u  is  simpler  than  v  then  the  pair  <X,  u>  appears  earlier  on  the  queue  than  the  pair  <Y,  v>.  Thus  if 
<X.  u>  is  the  pair  at  the  head  of  the  queue  then  u  must  be  at  least  as  simple  as  any  other  terminal  expression 
appearing  in  any  other  pair  on  the  queue.  The  procedure  maintains  the  following  three  invariants: 

1)  For  each  non-terminal  X  such  that  s(X)  is  defined,  X  generates  s(X)  under  G. 

2)  For  each  pair  <X,  u>  on  the  queue  X  generates  u  under  G. 

3)  Let  <X,  u>  be  the  f-rst  pair  on  the  queue.  For  any  non-terminal  symbol  Y  if  s(Y)  is 
defined  then  L(s(Y))<Uu). 

The  procedure  is  given  below,  initially  s  is  totally  undefined. 

1)  For  each  production  of  the  form  X=*a  where  a  is  a  terminal  symbol  place  the  pair  <X.  a> 
on  the  queue. 

2)  Take  the  first  pair  <X,  u>  off  the  queue.  If  s(X)  is  undefined  do  the  following: 

i)  Set  s(X)  to  u. 

ii)  For  each  production  of  the  form  Y  =*a  such  that  X  appears  in  a  and  s(a)  is 
defined,  place  the  pair  <Y,  s(«)>  on  the  queue. 

3)  If  the  queue  is  not  empty  go  to  2) 

'llic  algorithm  terminates  because  there  arc  only  finitely  many  non-terminal  symbols  which  can  be 


4.  SIMI’LII  If  VI  ION 


-22- 


May  19)G 


assigned  simplifications.  The  first  two  invariants  arc  that  if  s(X)  is  defined  then  X  generates  MX)  and  if  <\.  ii> 
is  a  pair  on  the  queue  then  X  generates  U.  Ihesc  two  invariants  are  easily  shown  to  he  true  in  tire  initial  state 
since  the  function  s  in  initially  totally  undefined  and  every  pair  on  die  queue  is  of  die  form  <X.  a>  where  a  is  a 
terminal  symbol  and  X=>a  is  a  production  of  G.  Given  drat  these  invariants  hold  it  can  easily  be  shown  that 
no  step  of  the  procedure  causes  eidicr  invariant  to  be  violated. 

The  final  invariant  is  that  for  any  non-terminal  Y.  if  s(Y)  is  defined  and  <X,  u>  is  the  first  pair  on  the 
queue  then  L(s(Y))<l.(u).  Because  the  queue  is  sorted  if  <X.  u>  is  the  first  pair  on  the  queue  then  !.(u)  is  the 
least  simplicity  level  assigned  any  terminal  expression  on  die  queue.  Thus  the  final  invariant  is  equivalent  to 
the  statement  that  whenever  s(Y)  is  defined,  s(  Y)  is  at  least  as  simple  as  any  terminal  expression  on  the  queue. 
This  invariant  holds  trivially  in  the  initial  condition  since  the  function  s  is  totally  undefined.  Furthermore 
simply  removing  a  pair  from  the  queue  can  never  cause  this  invariant  to  be  violated.  Ihc  only  times  the 
invariant  might  be  violated  is  when  s(X)  is  defined  for  some  X  or  when  a  pair  </.,  w>  is  added  to  the  queue. 

Consider  an  execution  of  step  2)  of  the  procedure  in  which  a  pair  <X,  u>  is  removed  from  the  queue 
and  s(X)  gets  set  to  u.  Let  <Y,  v>  be  the  pair  which  is  next  on  the  queue  when  the  pair  <X,  u>  is  removed. 
Given  that  the  invariant  is  already  in  force  L(s(Z))  must  be  less  than  or  equal  to  l.(u)  for  any  non-terminal 
symbol  Z  such  that  s(Z)  was  defined.  Furthermore  l.(u)  is  less  than  or  equal  to  l.(v)  and  thus  l.(s(7.))  is  less 
than  or  equal  to  Uv)  for  any  non-terminal  symbol  Z  such  that  s(Z)  was  defined.  Further  since  L(u)  is  less 
tlian  or  equal  to  I.(v),  l.(s(X))  will  be  less  than  or  equal  to  l.(v)  when  s(X)  is  set  to  u.  llius  the  removal  of 
<X.  u>  from  the  queue  and  the  setting  of  s(X)  to  u  docs  not  violate  die  invariant. 

Now  consider  die  queueing  of  any  pair  of  the  form  <Z,  s («)>.  Let  <Y.  v>  be  the  first  pair  on  die 

queue  before  the  pair  <Z,s(a)>  is  added.  For  every  non-terminal  symbol  W  such  that  sCvY*  ;s  defined 

I  ,(s(W))<L(v).  Given  this  fact  the  monotonicity  condition  on  the  simplicity  order  can  be  u*\i  to  show  (via  3 

simple  induction  on  expression  forms)  that  for  any  expression  form  a  such  that  s(a)  is  defined  L(s(«))<L(v). 

/ 

Thus  when  the  pair  <Z,  s (a)>  is  queued  l.(s(a))  is  less  than  or  equal  to  |j(v)  so  the  simplicity  level  of  die  first 
pair  on  the  queue  remains  the  same  and  the  invariant  is  not  violated. 

It  will  now  be  shown  that  if  X  is  any  coherent  non-terminal  (i.c.  the  language  generated  by  X  is  not 
empty)  then  s(X)  gets  defined  by  the  above  procedure  and  that  s(X)  is  a  simplicity  bound  under  It  is 
sufficient  to  show  by  induction  on  expressions  that  if  v  is  any  expression  covered  by  G  then  s(T(v))  gets 
defined  and  Msfr(v)))<^|j(v).  If  v  is  a  terminal  symbol  covered  by  G  then  the  pair  <T(v).  v>  will  be  placed  on 
the  queue  in  step  1)  of  the  procedure.  Ibis  ensures  that  s(T(v))  will  become  defined  and  that 
l.(s<r<v)))<L(v).  Now  suppose  v  is  an  expression  covered  by  G  which  is  of  the  form  <vj  v2 ...  vn>.  Since  v  is 
covered  by  G  each  Vj  must  also  be  covered  by  G  and  die  production  r(v)=><P(Vj)  P( v^) ...  r(\'n)>  must  be  a 
production  of  G.  Since  each  Vj  is  covered  by  G  die  induction  hypodiesis  implies  that  s(f(Vj))  gets  defined  and 
that  I  (s(r(Vj)))<L(V|).  Thus  the  monotonicity  condition  on  the  simplicity  order  implies  that: 

u<s(r(vj))s<nv2)) ...  scr(vn))»<L«v,  v2 ...  vn». 

Now  since  the  production  r(v)=*<f(vj)  r(v2)...  L(vn)>  is  in  G  and  since  s<f(Vj))  gets  defined  for 


4.  SIMPLIFICATION 


•23- 


May  1983 


each  Vj  the  pair  <l'(v)<s<r(vj)) sO'(v2)) ... s( l'(v n))>)>  gets  placed  on  the  queue.  Therefore  stl'lv))  gets 
defined  such  that: 


UsiRv)))  <  i.«s(r(v,))s(r(v2))  ...s(r(vn))» 

Combining  the  above  two  relations  we  get: 

L(s(r(v))  <  Uv) 

It  has  been  shown  that  for  each  coherent  non-terminal  symbol  X  that  s(X)  gets  defined  and  that  s(X) 
is  a  simplicity  bound  under  ~p.  Note  that  given  the  way  pairs  arc  placed  on  the  queue  and  the  way  s(X)  gets 
assigned,  if  s(X)  is  not  a  terminal  symbol  then  it  can  be  written  as  <s(Y  j)  s(Y 2 ) ...  s(Yn)>  where  each  Yj  is  a 
non-terminal  symbol.  Thus  every  subexpression  of  s(X)  is  also  a  simplicity  bound  and  therefore  s(X)  is 
simplified  under  ~p.  This  implies  that  for  any  expression  v  covered  by  G,  s(r(v))  is  a  simplification  of  v. 


In  analyzing  the  running  time  of  this  algorithm  let  |G|  be  the  total  size  of  the  grammar  G.  For  each 
pair  placed  on  the  queue  there  is  a  particular  production  X=>a  of  G  such  that  u  is  s(a).  A  given  production 
can  be  responsible  for  at  most  the  queueing  of  one  pair  so  that  the  number  of  pairs  placed  on  the  queue  is 
order  |G|.  The  time  required  to  queue  and  remove  order  |G|  elements  from  a  sorted  queue  (a  priority  queue) 
is  order  |G|log(|G|)  plus  the  time  it  takes  to  compute  order  |G|log(|G|)  comparisons.  A  given  production 
X=><Xj  X2  ...  Xn>  can  be  examined  by  part  ii)  of  step  2)  of  the  algorithm  at  most  n  times.  Thus  tire  total 
time  spent  in  part  ii)  of  step  2  (ignoring  queueing  time)  is  order  |G|.  Thus  the  total  time  taken  by  the  above 
procedure  is  order  |G|log(|G|)  plus  the  time  it  takes  to  perform  order  |G|log(|G|)  comparisons  between 
expressions. 


The  next  theorem  provides  a  representation  for  the  set  of  simplifications  of  an  expression  under  the 
congruence  relation  imposed  by  a  normal  grammar  G.  It  turns  out  that  the  set  of  simplifications  can  be 
described  by  a  grammar  which  is  a  subset  of  the  productions  of  G. 

Lemma  4.6:  For  any  well  behaved  simplicity  order  and  normal  grammar  G  with  maximal 
generator  function  T  there  is  a  subset  G’  of  the  productions  of  G  such  that  for  any  expression 
w  the  set  of  simplifications  of  w  under  ~  p  is  the  language  generated  by  T(w)  under  G\ 

•  • 

Proof:  Without  loss  of  generality  it  may  be  assumed  that  every  non-terminal  of  G  is  coherent. 

Thus  for  each  non-terminal  symbol  X  of  G,  MX)  can  be  defined  to  be  the  simplification  level  of 
any  simplicity  bound  generated  by  X,  i.e.  MX)  is  the  minimum  simplicity  level  assigned  any 
expression  generated  by  X.  1110  totalizer  1.  can  be  represented  as  a  triple  <l)  l.j  L2>  as  described 
above.  Given  such  a  representation  Mar)  can  be  defined  for  an  arbitrary  expression  form  a  via  the 
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following  relation: 


U<aj  a2  -  «n>)  =  I  2(<l.(a|)  Ua2)  •  •  U<*n)>) 


The  monotonicity  condition  on  the  simplicity  order  ensures  that  in  general  Ua)  is  the 
minimum  level  assigned  any  expression  generated  by  an  expression  form  a. 

Let  G’  be  that  subset  of  the  productions  X=*a  of  G  such  that  L(a)  equals  l.(X).  Now  it  will  be 
shown  by  induction  on  expressions  that  for  any  expression  v  covered  by  G,  if  T(v)  generates  v 
under  G’  then  Uv)  equals  URv)).  For  any  expression  v  covered  by  G.  T(v)  equals  some 
non-terminal  symbol  X  of  G.  If  v  is  a  constant  symbol  covered  by  G  then  if  X  generates  v  then  the 
production  X=»v  is  in  G’  and  by  the  definition  of  G’  this  implies  L(v)  equals  L(X).  If  v  is  of  the 
form  <vj  v2 ...  vn>  and  X  generates  v  under  G’  then  there  must  be  a  production  X=><X  j  X2 ...  Xn> 
of  G’  such  that  Xj=»*vj  for  each  i  and  thus  by  the  definition  of  O'  the  following  relations  must 
hold: 

MX)  =  L(<X1X2„.Xn>) 

=  L2«L(Xl)L(X2)...L(Xn)» 
and  by  the  induction  hypothesis: 

=  L2«Mv1)Uv2)...Uvn)» 

=  Uv) 

It  will  now  be  shown  that  for  any  expression  form  a.  if  a=>*v  under  G’  then  L(v)  equals  L(a). 
This  is  done  by  induction  on  expression  forms.  If  a  is  a  terminal  symbol  then  if  a=»*v  under  G’ 
then  a  must  equal  v  and  the  result  is  trivial.  If  a  is  a  non-terminal  symbol  and  a=**v  under  G’ 
then  T(v)  must  equal  a  and  the  result  follows  from  the  above  induction.  If  a  is  of  the  form 
<a  j  rr2 ...  an>  then  v  must  of  the  form  <vj  v2  ...vn>  where  aj=>*Vi  for  each  aj.  In  this  case  the 
result  follows  from  the  following  relations: 

L(a)  =  L(<aj  a2...an>) 


=  L 2KUaj)  Ua2) ...  Uon)>) 

and  by  the  induction  hypothesis: 


=  L2«Uv1)Uv2)...Uvn)>) 
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It  follows  from  the  above  induction  that  if  r(v)=>*v  under  G'  then  l.(v)  is  the  least  level 
assigned  any  expression  generated  by  f'(v)  under  G  and  therefore  v  is  a  simplicity  bound  under 
~p.  It  will  now  be  show  that  if  r(v)=»*v  under  G‘  then  v  is  simplified  under  ~p.  form  the 
above  observations  it  is  sufficient  to  show  that  if  T(v)=**v  under  G’  and  if  v  is  of  the  form 
<vjV2...  vn>  then  for  each  Vj,  r(vj)=>*Vj  under  G\  The  expression  form  P(v)  is  either  a 
non-terminal  symbol  or  the  n-tuplc  <T(v1)  r(v2) ...  Rvn)>.  In  the  latter  ease  T(Vj)=>*Vj  for  each 
Vj.  In  the  former  case  since  G’  is  a  one  level  grammar  each  vj  must  be  generated  under  G’  by  some 
non-terminal  symbol  of  G,  but  the  only  non-terminal  which  could  possibly  generates  v^  under  G'  is 
Rv:)  and  therefore  r(Vj)=»*vj  for  each  vj. 

For  any  maximal  expression  form  a  die  language  generated  by  a  under  G  is  an  equivalence 
class  of  Sp.  Furthermore  it  has  been  shown  that  if  a=>*v  under  G'  then  v  is  simplified  under 
~p.  Thus  for  any  expression  u  the  language  generated  by  Ru)  under  G’  is  a  subset  of  the 
simplifications  of  u.  It  now  remains  only  to  show  that  every  simplification  of  u  is  generated  by  T(u) 
under  G\ 

It  will  be  shown  by  induction  on  expressions  that  if  v  is  an  expression  which  is  simplified  under 
~p  then  T(v)=>*v  under  G\  If  v  is  a  constant  symbol  which  is  simplified  under  ~p  then  either 
T(v)  is  v  in  which  case  the  result  is  trivial  or  Rv)  is  a  non-terminal  symbol  such  that  the  production 
T(v)=>v  is  in  G.  But  since  v  is  simplified  under  ~p  L(v)  must  equal  LfRv))  and  thus  the 
production  T(v)=>v  is  in  G’.  Now  let  v  be  a  simplified  expression  of  the  form  <vj  v2  ...  vn>.  since 
v  is  simplified  so  is  each  vj  and  therefore  by  the  induction  hypothesis  for  each  Vj,  r(vj)=>*v-  under 
G’.  Furthermore  Rv)  is  either  <r(vj)  Rv2) ...  T(vn)>  or  a  non-terminal  symbol  X  such  that  the 
production  X=*<r(vj)  T(v2) ...  T(vn)>  is  in  G.  Thus  it  is  sufficient  to  prove  that  in  the  case  where 
Rv)  is  a  non-terminal  symbol  X  the  production  X=*<T(vj)  r(v2) ...  Rvn)>  is  in  G\  or  equivalently 
to  prove  that  L(<T(vj)  Rv2) ...  T(vn)>)  equals  L(X).  However  since  v  is  simplified  I.(X)  equals 
L(v)  and  furthermore  since  each  Vj  is  simplified  LfRvj))  equals  L(vj).  This  implies  the  following 
relations  which  establish  the  desired  result: 

i.«r(v1)r(v2)...r(vn)» 

=  L2«L(r(Vl))  ur(v2» ...  ur(vn»» 

=  L2«Uv1)L(v2)...Uvn)» 

=  L(v) 

=  UX) 

It  is  important  to  note  that  the  subset  G’  of  the  productions  of  G  can  be  computed  in  a 
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straight  forward  way  given  an  assignment  s  of  simplified  expressions  to  non-terminal  symbols  such  as  the  one 
produced  in  the  procedure  described  earlier.  Specifically  a  production  X=>«  is  in  G'  just  in  case  s(X)  is  not 
simpler  than  s(a). 

I  he  following  theorem  has  now  been  established: 

Theorem  4.7:  For  any  consistent  equational  Boolean  formula  Q,  any  expression  u,  and  any 
well  behaved  simplification  order  the  set  of  simplifications  of  u  under  ~q  is  a  context  free 
expression  language. 
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5.  CONCLUSIONS  AND  OPEN  PROBLEMS 

The  notion  of  an  expression  is  more  general  than  the  notion  of  first  order  term.  Many  different 
logical  formalisms  consist  at  least  in  part  of  expressions  in  which  the  substitution  of  equals  for  equals  is  valid. 
Thus  the  concepts  and  algorithms  discussed  here  can  be  used  for  valid  inference  in  a  wide  variety  of  deductive 
systems. 

The  relationship  which  has  been  established  between  finite  sets  of  equations  and  context  free 
expression  grammars  may  have  applications  beyond  the  deduction  techniques  and  simplification  algorithms 
discussed  here.  For  example  consider  unification  under  a  finitely  cquational  congruence  relation.  More 
precisely  let  u  and  v  be  two  expressions  containing  variables  and  let  2  be  a  finite  set  of  equations  between 
ground  expressions.  For  any  substitution  a  and  expression  u  let  <r(u)  be  the  result  of  simultaneously 
replacing  each  variable  of  u  with  its  image  under  a.  The  unification  problem  for  the  finite  set  of  equations  2 
is  to  characterize  the  set  of  ground  substitutions  <r  such  that  a(u)~^o(v).  An  obvious  first  step  in 
approaching  this  problem  is  to  consider  a  grammar  G  which  represents  2  and  consider  substitutions  over  the 
maxima]  expression  forms  of  G  (remember  that  each  maximal  expression  form  of  G  represents  an 
equivalence  class  under  2).  The  details  of  a  unification  algorithm  have  not  been  worked  out  and  it  is  not  clear 
what  the  complexity  of  this  problem  is.  Another  extension  would  be  to  develop  a  unification  algorithm  for 
cquational  Boolean  formulas  (remember  that  for  a  Boolean  formula  Q.  ~q  need  not  be  finitely  cquational). 

It  is  hoped  that  the  material  presented  here  will  be  applicable  to  various  domains  including  program 
verification,  compiler  optimization  techniques,  and  deductive  data  bases. 
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